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Similar 
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11 
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16 
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19 
21 
23 
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28 
30 
31 
34 
35 
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39 
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<110> APPLICANT: Tomikawa, Mayumi 
Aikawa/ Seiichi 
Matsuzawa/ Fumiko 

<120> TITLE OF INVENTION: Method and Apparatus for Extracting and Evaluating Mutually 



Portions in One-Dimensional Sequences in Mole 

Structures of Molecules 
<130> FILE REFERENCE: 522.1921D2 
<140> CURRENT APPLICATION NUMBER: 09/910,071 
<141> CURRENT FILING DATE: 2001-07-23 
<150> PRIOR APPLICATION NUMBER: US 08/014,867 
<151> PRIOR FILING DATE: 1993-02-08 
<160> NUMBER OF SEQ ID NOS : 20 
<170> SOFTWARE: Patentin version 3.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 37 
<212> TYPE: PRT 
<213> ORGANISM: human 
<400> SEQUENCE: 1 

Gly Asp Val Glu Lys Gly Lys Lys lie Phe lie Met Lys 

15 10 

His Thr Val Glu Gly Gly Lys His Lys Thr Gly Pro Asn 

20 25 
Leu Phe Gly Arg Lys 
35 

<210> SEQ ID NO: 2 

<211> LENGTH: 39 

<212> TYPE: PRT 

<213> ORGANISM: bacterium 

<400> SEQUENCE: 2 

Glu Gly Asp Asp Ala Ala Ala Gly Glu Lys Val Ser Lys 

15 10 

Ala Cys His Thr Phe Asp Gin Gly Gly Ala Asn Lys Val 

20 25 
Pro Asn Leu Phe Gly Val Phe 
35 

<210> SEQ ID NO: 3 
<211> LENGTH: 51 
<212> TYPE: PRT 
<213> ORGANISM: rat 
<400> SEQUENCE: 3 

Met Ser Leu Ala lie Leu Arg Val lie Arg Leu Val Arg 

15 10 

lie Phe Lys Leu Ser Arg His Ser Lys Gly Leu Gin lie 



cules and/or Three-Dimensional 



RECEIVED 

MAR 1 3 2002 
Technology Center 2100 



Cys Ser Gin 
15 

Leu His Gly 
30 



Lys Cys Leu 
15 

Gly Pro Asn 
30 



Val Phe Arg 
15 

Leu Gly Arg 
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73 20 25 30 

76 Thr Leu Lys Ala Ser Met Arg Glu Leu Gly Leu Leu lie Phe Phe lie 

77 35 40 45 

80 Gly Val Val 

81 50 

84 <210> SEQ ID NO : . 4 

85 <211> LENGTH: 142 

86 <212> TYPE: PRT 

87 <213> ORGANISM: unknown 

89 <220> FEATURE: 

90 <22 3> OTHER INFORMATION: Figure 2 3A 
92 <400> SEQUENCE: 4 

94 Thr Glu Glu Gin lie Ala Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp 

95 1 5 10 15 

98 Lys Asp Gly Asp Gly Thr lie Thr Thr Lys Glu Leu Gly Thr Val Met 



99 


20 








25 








30 






1U2 


Arg Ser Leu 


Gly 


Gin 


Asn 


Pro 


Thr 


Glu 


Ala 


Glu 


Leu 


Gin 


Asp 


Met 


He 


103 


35 










40 










4 D 








106 


Asn Glu Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Gly 


Thr 


He 


Asp 


Phe 


Pro 


Glu 


Phe 


107 


50 








55 










60 










110 


Leu Thr Met 


Met 


Ala 


Arg 


Lys 


Met 


Lys 


Asp 


Thr 


Ser 


Glu 


Glu 


Glu 


He 


111 


65 






70 










75 










80 


T t ;1 
114 


Arg Glu Ala 


Phe 


Arg 


Val 


Phe 


Asp 


Lys 


Asp 


Gly 


Asn 


Gly 


Tyr 


He 


Ser 


115 






85 










90 










95 




118 


Ala Ala Glu 


Leu 


Arg 


His 


Val 


Met 


Thr 


Asn 


Leu 


Gly 


Glu 


Lys 


Leu 


Thr 


119 




100 










105 










110 






122 


Asp Glu Glu 


Val 


Asp 


Glu 


Met 


He 


Arg 


Glu 


Ala 


Asn 


He 


Asp 


Gly 


Asp 


123 


115 










120 










125 








126 


Gly Gin Val 


Asn 


Tyr 


Glu 


Glu 


Phe 


Val 


Gin 


Met 


Met 


Thr 


Ala 






127 


130 








135 










140 










130 


<210> SEQ ID NO 


5 
























131 


<211> LENGTH: 159 
























132 


<212> TYPE: 


PRT 


























133 


<213> ORGANISM: 


unknown 






















135 


<220> FEATURE: 


























136 


<223> OTHER 


INFORMATION 


Figure 


2 3B 
















138 


<400> SEQUENCE: 


5 
























140 


Ala Met Asp 


Gin 


Gin 


Ala 


Glu 


Ala 


Arg 


Ala 


Phe 


Leu 


Ser 


Glu 


Glu 


Met 


141 


1 




5 










10 










15 




144 


He Ala Glu 


Phe 


Lys 


Ala 


Ala 


Phe 


Asp 


Met 


Phe 


Asp 


Ala 


Asp 


Gly 


Gly 


145 




20 










25 










30 






148 


Gly Asp He 


Ser 


Thr 


Lys 


Glu 


Leu 


Gly 


Thr 


Val 


Met 


Arg 


Met 


Leu 


Gly 


149 


35 










40 










45 








152 


Gin Asn Pro 


Thr 


Lys 


Glu 


Glu 


Leu 


Asp 


Ala 


He 


He 


Glu 


Glu 


Val 


Asp 


153 


50 








55 










60 










156 


Glu Asp Gly 


Ser 


Gly 


Thr 


He 


Asp 


Phe 


Glu 


Glu 


Phe 


Leu 


Val 


Met 


Val 


157 


65 






70 










75 










80 


160 


Arg Gin Met 


Lys 


Glu 


Asp 


Ala 


Lys 


Gly 


Lys 


Ser 


Glu 


Glu 


Glu 


Leu 


Ala 


161 






85 










90 










95 
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164 Asp Cys Phe Arg lie Phe Asp Lys Asn Ala Asp Gly Phe lie Asp lie 

165 100 105 110 

168 Glu Glu Leu Gly Glu lie Leu Arg Ala Thr Gly Glu His Val Thr Glu 

169 115 120 125 

172 Glu Asp lie Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly 

173 130 135 140 

176 Arg lie Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly Val Gin 

177 145 150 155 

180 <210> SEQ ID NO: 6 

181 <211> LENGTH: 28 

182 <212> TYPE: PRT 

183 <213> ORGANISM: unknown 

185 <2 2 0> FEATURE: 

186 <223> OTHER INFORMATION: calmodulin probe site 81-108, Figure 25, target 
188 <400> SEQUENCE: 6 

190 Leu Ala Asp Cys Phe Arg lie Phe Asp Lys Asn Ala Asp Gly Phe lie 

191 15 10 15 

194 Asp He Glu Glu Leu Gly Glu He Leu Arg Ala Thr 

195 20 25 

198 <210> SEQ ID NO: 7 

199 <211> LENGTH: 28 

200 <212> TYPE: PRT 

201 <213> ORGANISM: unknown 

203 <220> FEATURE: 

204 <223> OTHER INFORMATION: calmodulin probe site 81-108, probe. Figure 25 
206 <400> SEQUENCE: 7 

208 He Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr He 

209 1 5 10 .15 

212 Ser Ala Ala Glu Leu Arg His Val Met Thr Asn Leu 

213 20 25 

216 <210> SEQ ID NO: 8 

217 <211> LENGTH: 27 

218 <212> TYPE: PRT 

219 <213> ORGANISM: unknown 

221 <220> FEATURE: 

222 <223> OTHER INFORMATION: Calmodulin probe site 117-143, Figure 26, target 
224 <400> SEQUENCE: 8 

226 He Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly Arg He 

227 15 10 15 

230 Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly 

231 20 25 

234 <210> SEQ ID NO : 9 

235 <211> LENGTH: 27 

236 <212> TYPE: PRT 

237 <213> ORGANISM: unkown 
239 <400> SEQUENCE: 9 

241 Val Asp Glu Met He Arg Glu Ala Asn He Asp Gly Asp Gly Gin Val 

242 15 10 15 
24 5 Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
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246 20 25 

249 <210> SEQ ID NO: 10 

250 <211> LENGTH: 8 

251 <212> TYPE: PRT 

2 52 <213> ORGANISM: unknown 

254 <220> FEATURE: 

255 <223> OTHER INFORMATION: Coordinates of C-alpha corresponding to the aminon acid 
residue N 

256 OS. 7 to 14 in elongation factor of protein which is a binding si 

257 te for phosphoric acid of GTP 
259 <400> SEQUENCE: 10 

261 Gly His Val Asp His Gly Lys Thr 

262 1 5 

265 <210> SEQ ID NO: 11 

266 <211> LENGTH: 8 

267 <212> TYPE: PRT 

268 <213> ORGANISM: unknown 

270 <220> FEATURE: 

271 <223> OTHER INFORMATION: adenylate kinase - target 
273 <400> SEQUENCE: 11 

275 Gly Ala Pro Gly Ser Gly Lys Gly 

276 1 5 

279 <210> SEQ ID NO: 12 

280 <211> LENGTH: 8 

281 <212> TYPE: PRT 

282 <213> ORGANISM: unknown 

284 <220> FEATURE: 

285 <223> OTHER INFORMATION: ras protein - target 
287 <400> SEQUENCE: 12 

289 Gly Ala Gly Gly Val Gly Lys Ser 

290 1 5 

293 <210> SEQ ID NO: 13 

294 <211> LENGTH: 8 

295 <212> TYPE: PRT 

296 <213> ORGANISM: unknown 

298 <220> FEATURE: 

299 <223> OTHER INFORMATION: adenylate kinase (3ADK) 
301 <400> SEQUENCE: 13 

303 Gly Gly Pro Gly Ser Gly Lys Gly 

304 1 5 

307 <210> SEQ ID NO: 14 

308 <211> LENGTH: 223 

309 <212> TYPE: PRT 

310 <213> ORGANISM: unknown 

312 <220> FEATURE: 

313 <223> OTHER INFORMATION: amino acid sequence of trypsin 
315 <400> SEQUENCE: 14 

317 lie Val Gly Gly Tyr Thr Cys Cys Ala Asn Thr Val Pro Tyr Gin Val 

318 15 10 15 

321 Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu lie Asn Ser 
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322 








20 










2 5 










■3 r\ 
J U 






325 


Gin 


Trp 


Val 


Val 


Ser 


Ala 


Ala 


His 


Cys 


Tyr 


Lys 


Ser 


Gly 


lie 


bin 


vai 


326 






35 










40 










45 








329 


Arg 


Leu 


Gly 


Glu 


Asp 


Asn 


He 


Asn 


Val 


Vai 


Glu 


Gly Asn 


tjlU 


vjI n 




330 




50 










55 










60 










333 


He 


Ser 


Ala 


Ser 


Lys 


Ser 


He 


Val 


His 


Pro 


Ser 


Tyr 


Asn 


Ser 


Asn 


inr 


334 


65 










70 










7 5 










0 U 


337 


Leu 


Asn 


Asn 


Asp 


He 


Met 


Leu 


He 


Lys 


Leu 


Lys 


Ser 


Ala 


Ala 


Ser 


Leu 


338 










85 










90 














341 


Asn 


Ser 


Arg 


Val 


Ala 


Ser 


He 


Ser 


Leu 


Pro 


Thr 


Ser 


Cys 


Ala 


Ser 


Aia 


342 








100 










105 










IIU 






345 


Gly 


Thr 


Gin 


Cys 


Leu 


He 


Ser 


Gly 


Trp 


Gly 


Asn 


Thr 


Lys 


Ser 


Ser 




346 






115 










120 










12 5 








349 


Thr 


Ser 


Tyr 


Pro 


Asp 


Val 


Leu 


Lys 


Cys 


Leu 


Lys 


Ala 


Pro 


lie 


Leu 


Ser 


350 




130 










-I "3 C 

135 










14 0 










353 


Asp 


Ser 


Ser 


Cys 


Lys 


Ser 


Ala 


Tyr 


Pro 


Gly 


(jin 


He 


Thr 


Ser 


Asn 


1*1 C L. 


354 


145 










150 










X 










_L D W 


357 


Phe 


Cys 


Ala 


Gly 


Tyr 


Leu 


Glu 


Gly 


Gly 


Lys 


Asp 


Ser 


Cys 


r* 1 n 
VjIO 


V7iy 


S G 


358 










165 










± / U 










1 7 




361 


Ser 


Gly 


Gly 


Pro 


Val 


Val 


Cys 


Ser 


Gly 


Lys 


Leu 


Gin 


Gly 


lie 


Val 


OCX 


362 








180 










185 










190 






365 


Trp 


Gly 


Ser 


Gly 


Cys 


Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 


366 






195 










200 










205 








369 


Val 


Cys 


Asn 


Tyr 


Val 


Ser 


Trp 


He 


Lys 


Gin 


Thr 


He 


Ala 


Ser 


Asn 




370 




210 










215 










220 










373 


<210> SEQ ID NO 


: 15 
























374 


<211> LENGTH: 239 
























375 


<212> TYPE: 


PRT 


























376 


<213> ORGANISM: 


unknown 






















378 


<2 20> FEATURE: 


























379 


<223> OTHER 


INFORMATION 


: amino acid 


sequence of 


Elastase 




381 


<400> SEQUENCE: 


15 
























383 


Val 


Val 


Gly 


Gly 


Thr 


Glu 


Ala 


Gin 


Arg 


Asn 


Ser 


Trp 


Pro 


Ser 


Gin 


lie 


384 


1 








5 










10 










15 




387 


Ser 


Leu 


Gin 


Tyr 


Arg 


Ser 


Gly 


Ser 


Ser 


Trp 


Ala 


His 


Thr 


Cys 


Gly 


Ciriy 


388 








20 










25 










30 






391 


Thr 


Leu 


He 


Arg 


Gin 


Asn 


Trp 


Val 


Met 


Thr 


Ala 


Ala 


His 


Cys 


Val 


Asp 


392 






35 










40 










45 








395 


Arg 


Glu 


Leu 


Thr 


Phe 


Arg 


Val 


Val 


Val 


Gly 


Glu 


His 


Asn 


Leu 


Asn 


Gin 


396 




50 










55 










60 










399 


Asn 


Asn 


Gly 


Thr 


Glu 


Gin 


Tyr 


Val 


Gly 


Val 


Gin 


Lys 


He 


Val 


Val 


Pro 


400 


65 










70 










75 










80 


403 


Tyr 


Trp 


Asn 


Thr 


Asp 


Asp 


Val 


Ala 


Ala 


Gly 


Tyr 


Asp 


He 


Ala 


Leu 


Leu 


404 










85 










90 










95 




407 


Arg 


Leu 


Ala 


Gin 


Ser 


Val 


Thr 


Leu 


Asn 


Ser 


Tyr 


Val 


Gin 


Leu 


Gly 


Val 


408 






100 










105 










110 






411 


Leu 


Pro 


Arg 


Ala 


Gly 


Thr 


He 


Leu 


Ala 


Asn 


Ser 


Pro 


Cys 


Tyr 


He 


Thr 


412 






115 










120 










125 








415 


Thr 


Gly 


Trp 


Gly 


Leu 


Thr 


Arg 


Thr 


Asn 


Gly 


Gin 


Leu 


Ala 


Gin 


Thr 


Leu 
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